Are stable instances easy? 



Yonatan Bilu * 
Mobileye Vision Technologies Ltd. 
12 Hartom Street, PO Box 45157 
Jerusalem, 91450 Israel, 
yonatan.bilu @ mobileye.com 



Nathan Linial * 
Institute of Computer Science 
Hebrew University 
Jerusalem 91904, Israel. 
nati@cs.huji.ac.il 



June 17, 2009 



Abstract 



We introduce the notion of a stable instance for a discrete optimization problem, and argue that 
in many practical situations only sufficiently stable instances are of interest. The question then arises 
whether stable instances of NP-hard problems are easier to solve. In particular, whether there exist 
algorithms that solve correctly and in polynomial time all sufficiently stable instances of some NP-hard 
problem. The paper focuses on the Max-Cut problem, for which we show that this is indeed the case. 

1 Introduction 

Computational complexity theory as we know it today is concerned mostly with worst-case analysis of 
computational problems. For example, we say that a problem is NP-hard if the existence of an algorithm 
that correctly decides every instance of the problem implies that SAT can be decided in a polynomially 
equivalent time complexity. However, the study of decision and optimization problems is motivated not 
merely by theoretical considerations. Much of our interest in such problems arises because they formalize 
certain real-world tasks. From this perspective, we are not interested in all problem instances, but only in 
those which can actually occur in reality. 

This is often the case with clustering problems, which are ubiquitous in most fields of engineering, exper- 
imental and applied science. Any concrete formulation of the clustering problem is likely to be NP-hard. 
However this does not preclude the possibility that the problem can be solved efficiently in practice. In fact, 
in numerous application areas, large-scale clustering problems are solved on a regular basis. As mentioned 
above, we are only interested in instances where the data is actually made up of fairly well-defined clusters 
- the instances where solving the problem is interesting from the practical perspective. 

Put differently, the usual way for proving that clustering is NP-hard is by a reduction to, say, SAT. This 
reduction entails the construction of instances for the clustering problem, such that the existence of an 
algorithm that can solve all of them efficiently implies the existence of an algorithm that efficiently solves 
SAT. However, it may well be the case that all these instances are clearly artificial, and solving them is of no 
practical interest. 

*This research is supported by grants from the binational Science Foundation Israel-US and the Israel Science Foundation. 
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As a concrete example, consider the problem of clustering protein sequences into families. Out of the 
enormous space of all possible sequences, only a tiny fraction is encountered in nature, and it is only about 
these (or slight modifications thereof) that we actually care. 

Our case in point is the Max-Cut problem, which can be thought of as a clustering into two clusters. It is 
well known that this problem is NP-complete, and so it is believed that there is no algorithm that solves 
it correctly on all graphs, in polynomial time. In this work we strive to identify properties of instances of 
the Max-Cut problem (i.e., of weighted graphs), which capture the notion that the input has a well-defined 
structure w.r.t Max-Cut (i.e., the maximal cut "stands out" among all possible cuts). Our goal is to show that 
Max-Cut can be solved efficiently on inputs that have such properties. 

Consideration of a similar spirit have led to the development of Smoothed Analysis initiated in fTSl, (see 
116] for some of the exciting developments in that area. The similarity has two main facets: (i) Both lines of 
research attempt to investigate the computational complexity of problems from a non- worst-case perspective, 
(ii) Both are investigations of the geometry of the instance space of the problem under consideration. The 
goal being to discover interesting parts of this space in which the instances have complexity lower than the 
worst case. Viewed from this geometric perspective, the set-up that we study here is very different than 
what is done in the theory of smoothed analysis. There one shows that the hard instances form a discrete 
and isolated subset of the input space. Consequently, for every instance of the problem, a small random 
perturbation is very likely to have low computational complexity. In the problems that we study here the 
situation is radically different. The "interesting" instances {stable instances as we shall call them) are very 
rare. Indeed, it is not hard to show that under reasonable models of random instances the probability that 
a random instance be stable is zero, or at least tends to zero as the problem size grows. What we wish to 
accomplish is to efficiently solve all instances within this subspace. We claim that this tiny set is interesting 
because it includes all realistic clustering problems. 

The notion of stability is central to our work. This is a concrete way to formalize the notion that the 
only instances of interest are those for which small perturbation in the data (which may reflect e.g. some 
measurement errors) do not change the optimal partition of the graph. 

Definition 1.1. Let W be an n ^ n symmetric, non-negative matrix. A 7-perturbation ofW, for 7 > 1, is 
an n X n matrix W' such that Wi, j = 1, . . . , n, Wij < j < 7 • Wij. 

Let (S, [n]\S) be a maximal cut ofW, i.e. a partition that maximizes X^igs instance W (of 

the Max-Cut problem) is said to be '^-stable, if for every ^-perturbation W' ofW, (S, [n]\S) is the unique 
maximal cut of W. 

However this definition is, perhaps, not sufficient. Consider two bipartite graphs which are joined togther 
by a single edge. The resulting graph is 7-stable for all 7, but the alignment of the two bipratite graphs 
with respect to one another completely depends on the adjoining edge. Hence, to better capture our intution 
of what it means for a solution to be stable, it is reasonable to demand that in addition to stability the 
graph contains no small cuts. We show that the combination of both these properties indeed allows solving 
Max-Cut efficiently (Example 1431). 

In section [3] we present an algorithm that solves correctly and in polynomial time 7-stable instances of Max- 
Cut: (i) On simple graphs of minimal degree 6, when 7 > and (ii) On weighted graphs of maximal 
degree A when 7 > VAn. In section |4] we explore several spectral conditions which make Max-Cut 
amenable on stable instances. This involves analyzing the spectral partitioning heuristic for Max-Cut. In 
particular, we show that Max-Cut can be solved efficiently on (locally) stable expander graphs, and on 
graphs where the solution is sufficiently distinct from all other cuts. We conclude by deducing an improved 
approximation bound for the Goemans-Williamson algorithm on stable instances, and by showing that Max- 
Cut is easy in a certain random model for such instances. 
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Finally, we should mention that this is just a first step. In particular, it is of great interest to study more 
permissive notions of stability where a small perturbation can slightly modify the optimal solution. There 
are also other natural ways to capture the concept of stability. Similar considerations can be applied to many 
other optimization problems. Some of these possibilities are briefly discussed below, but these questions are 
mostly left for future investigations. 

2 Preliminaries 

2.1 Notation 

Throughout the paper we denote the vertex set of the graph G under discussion by [n] . A vector G K" 
induces the partition of [n] into the sets, ({i : > 0}, {i : < 0}). Viewed as a partition of G's vertex 
set, we call it the cut induced by v in G. 

The indicator vector of a partition {S, S) of [n] (or a cut in G), is the vector v G {—1, 1}", with = 1 iff 
i G S. 

For a weighted graph G, we denote the indicator vector of its maximal cut by mc*. We generally assume 
that this cut is unique, otherwise mc* is an indicator of some maximal cut. 

For a subset Ac [n], we denote A = [n] \A. 

For two disjoint subsets of vertices in the graph, A, B, we denote by E{A, B) the set of edges going between 
them, and w{A, B) = '}2{ij)eE{A b) With a shght abuse of notation, we denote w{i) = J2j Wij. 
Finally, for a set of edges F c E, denote w{F) = j)&F 

We switch freely between talking about the graph and about its associated weight matrix. Given a symmetric 
nonnegative n x n matrix W with zero trace (as input to the max cut problem), we define its support as a 
graph G = (V, E) with vertex set V = [n] where (i, j) G E iff Wij > 0. 

2.2 Properties and equivalent definitions 

A useful way to think of 7-stability is as a game between two (computationally unbounded) players. Measure 

and Noise: Given a graph G, Measure chooses a cut {S, S). Noise then multiplies weights of his choice 
by factors between 1 and 7, obtaining a graph G' (over the same vertex and edge sets, but with possibly 
different weights). He then chooses a different cut, (T, T). Noise wins if in G' w{T,T) > w{S,S). 
Otherwise, Measure wins. A graph is 7-stable if Measure has a winning strategy. 

Observe that the players' strategy is clear: Measure chooses the maximal cut, and Noise , w.l.o.g., multiphes 
by 7 the weights of the edges in E{T,T)\E{S, S). Multiplying weights of other edges either does not 
change w{T, T) — w{S, S), or decreases it. Hence, we arrive at an equivalent definition for 7-stability is: 

Proposition 2.1. Let 7 > 1. A graph G graph with maximal cut {S, S) is j-stable (w.r.t. Max-Cut) if for 
every vertex set T ^ S,S, 

w{E{S, S)\E{T, f )) > 7 • wiE{T, f)\E{S, S)). 

This view of stability suggests how 7-stable graphs can be generated: Let G' be a 7'-stable graph. Multiply- 
ing the weights of all the edges in the maximum cut by ^ yields a 7-stable graph G. Moreover, it is not hard 
to see that all 7-stable graphs can be obtained this way. In other words, in the following random process 
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every 7-stable graph on n vertices has a positive probability: Generate a random graph on n vertices, say 
according to G{n,p) (for some p / 0, 1); Find the maximal cut and its stability 7'. Multiply all cut edges 
by ^. Note, however, that in this naive model the maximal cut can be easily identified by simply examining 
edge weights - those of weight ^ are the cut edges. 

One pleasing aspect of 7-stability is that it is oblivious to scale - multiplying all weights in a graph by a 
constant factor does not change its stability. This can be readily seen from Proposition 12.11 It may seem 
natural to define 7 two-way stability as robustness to perturbation by a multiplicative factor between ^ and 
7 (so called, two-way perturbation). But obliviousness to scale easily implies that a graph is 7 two-way 
stable iff it is 7^-stable. 

It is also natural to consider a solution as "interesting" if it stands out among all the alternatives. Let {S, S) 
be a maximal cut in a graph G, and consider an alternative cut, (T, T). Consider the set E{S, S)AE(T, T) 
of those edges on which the two cuts "disagree". We seek to measure the difference between the cuts (S*, S) 
and (T, f) relative to the size of w{E{S, S)AE(T, f)). So say that (S, S) is a edge distinct (with a > 0), 
if for any T CV, 

w{S,S) -w{T,T) > a-w{E{S,S)AE{T,f)). 

Now, denote Wt = w{E{T, f)\E{S, S)) and Ws = w{E{S, S)\E{T, f )). If G is a edge distinct then 

Ws-Wt = w{S, S) - w{T, f ) > a • w{E{S, S)AE{T, f)) = a- {Ws + Wt). 

Hence, Ws > j^^t, and by Proposition I2.1I G' is j^-stable. Similarly, if G is 7-stable, then it is ^^^1 
edge distinct. 

2.3 Variations on a theme 

We shall also be interested in a weaker version of stability, which proves useful for some of the results in 
sequel: 

Definition 2.1. Let W be an instance of the Max-Cut problem and let (S, S) be its optimal partition. We 
say that W is 7-locally stable if for all v ^ S 

l-^Wu,v <^Wu,v, 

and for all v £ S 

l-^Wu,v < ^Wu,v, 

Observe that every 7-stable graph is also 7-locally stable - this follows from Definition 12.11 with T being a 
single vertex. 

It is essentially known that Max-Cut is NP-hard even when restricted to 7-locally stable instances (for 7 at 
most exponential in the size of the input) fT3lf^ In fact, one can impose local stability, without altering the 

'The NP-completeness of Max-Cut can be shown by a reduction from 3-Not-all-equal SAT: Construct a graph over the formula's 
hterals, and for every 3-clause define three edges (a triangle) connecting the clause's literals. It is not hard to see that the formula 
is satisfyable iff the graph's Max-Cut's value is twice the number of clauses. It is also not hard to see that if this is indeed the case, 
the cut is 2-locally stable. Furthermore, by adding edges between a literal and its negation, the structure of the Max-Cut does not 
change, and local stability increases. 
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overall stability: Let G be a graph with weighted adjacency matrix W. Let be a graph on V x {0, 1}, 
with weighted adjacency matrix: 

\T-w{i)-I W J 

(for some r > 1.) 

It is not hard to see that the maximal cut in consists of two copies of that in G. Specifically, {S, S) is a 

maximal cut in G iff (5 x {0} U 5 x {1}, 5 x {1} U 5 x {0}) is a maximal cut in G^ . 

It is also not hard to see that G is 7-stable, iff G^ is, and that G^ is at least 2T-locally stable. 

The definition of stability via edge distinctness formalizes the notion that in instances of interest, the Max- 
Cut should be distinctly better than all other cuts. Clearly, cuts which differ only slightly from the maximum 
one in structure can only differ slightly in value, so the difference in value should be quantified in terms of 
of the distance between the cuts. 

Definition 2.2. Let (S, S) be a cut in a (weighted) graph G = (V, E) and k > 0. We say that this cut is 
A;-distinct if for any cut (T, T), 

w{e{S,S)) -w{e{T,T) > A;min{|SAr|, |5Af|}. 

We say that a graph is (k, 7)-distinct (w.rt. Max-Cut) if its maximal cut is k-distinct and ^-locally stable. 

In example l44l we show that Max-Cut can be solved on {k, 7)-distinct instance when k and 7 are sufficiently 
large. 

3 Combinatorial approach 

One approach in solving a Max-Cut problem is to identify a pair of vertices which must to be on the same 
side of the optional cut (e.g. in a simple graph, two vertices with the same neighborhood). Two such vertices 
can be safely merged into a single vertex - keeping multiple edges. If this can be repeated until a bipartite 
graph is obtained, then the problem is solved. 

Observe that if G is a 7-stable graph, and i, j are two vertices on the same side of the maximal cut, then 
the graph G', obtained from G by merging i and j into a single vertex i', is 7-stable as well. Indeed, any 
7-perturbation of G' induces a 7-perturbation of G over the same edges. If as a result of this perturbation the 
maximal cut changes in G', then this new cut is also maximal in the similarly perturbed G, since it contains 
the same edges (in contradiction with G being 7-stable). 

This observation implicitly guides the first algorithm presented below. In it we identify pairs of vertices 
which are on opposite sides of the maximal cut. By continuing to do so, we grow bigger and bigger con- 
nected bipartite subgraphs, until they all connect. In the second algorithm we explicitly merge together 
vertices on the same side as long as we know how to, and then, once we have a much smaller graph, use the 
first algorithm. 

3.1 An efficient algorithm for n-stable instances 

We start by describing an algorithm, that solves the Max-Cut problem on (weighted) graphs of maximal 
degree A which are V An-stable. The idea is to iteratively identify sets of edges which belong to the 
maximal cut. When they form a connected spanning bipartite graph, the maximal cut is found. 
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FindMaxCut(G) (G is a weighted graph) 

1. Initiahze T — (F(G), 0). Throughout the algorithm T will be a bipartite subgraph of G. 

2. While T is not connected, do: 

(a) Let Gi , . . . , Gt be the connected components of T. Each of them is a bipartite graph, 
with vertex bipartition V{Ci) — {Li, Ri). 

(b) Let Gi* be a component with the least number of vertices. For each j — 1, . . . ,t, 

j / r, let E'^ = E{Li^,Lj) U E(Ri,,Rj) and E] = E(Li^,Rj) U E{Ri<-,Lj). 
Let j* and c* be such that the weight of Ej, is the largest among all Ej. 

(c) Add the edges of Ef. to T 

3. Output the cut defined by the two sides of T. 

Theorem 3.1. There is an algorithm that solves correctly and in polynomial time every instance of weighted 
Max-Cut that is ■^-stable for every 7 > \/An. Here an instance is an n-vertex graph of maximal degree A. 

Proof: We will show that the above algorithm is well defined, and outputs the correct solution on ^/nK- 
stable instances of Max-Cut. Let (5, S) be the maximal cut. We maintain that throughout the algorithm, S 
separates each connected component Cj = {Li^Ri). Namely, either Li C S, Ri C V\S or Ri C S, Li C 
V\S. 

This clearly holds at the outset. If it holds at termination, the algorithm works correctly. So consider the 
first iteration when this does not hold. Let Ci* be a smallest connected component at this stage, and denote 
k = \Ci*\. Up to this point our assumption holds, so say Lj* C 5 and Ri* n = 0. Let j* and c* 
be those chosen as in step |2b] Since this is the point where the algorithm errs, Ej, is added to T, yet 

Ef n E{S, S) = 0. 

Now consider the 7-perturbation of the graph obtained by multiplying the edges in Ef by 7. If the original 
graph is 7-stable, the maximal cut of the perturbed graph is (5, S) as well. Consider the cut obtained by 
flipping the sides of Lj. and Ri*. That is, denote Z = S\Li* U Ri*, and consider the cut (Z, Z). 
The cut (Z, Z) contains the edges Ej, , which {S, S) does not. For each j / j*, let cj be such that Ef is in 
the cut {S, S) (we'll be interested only in non-empty subsets). In the extreme case, all these edges ai^e not 
in the cut (Z, Z). Observe that all other edges in £'(£', 5) are also in E{Z,Z). 

Define J = {j ^ i : Ef / 0}. Since the weight of (Z, Z), even in the perturbed graph, is smaller than that 
of {S, S), we have that: 

j-w{Ef)<^wiE;^). 

(The l.h.s. is a lower bound on what we gain when we switch from S to Z, and the r.h.s. is an upper 
bound on the loss.) Recall that Ef was chosen to be the set of edges with the largest total weight. Hence, 

T.jeJ^i^j'') - l'^l'^(^i')' so 7 < I J|. Clearly, \J\ < min{f , kA}, and so: 

7^ < ^kA = nA. 

k 

This is a contradiction to the assumption that the input is V nA-stable. | 

Note that we have actually proven that the algorithm works as long as it can find a connected component 
d*, such that \{i : E^ / 0}| < 7, for a = 0,1. 

The concept of stability clearly applies to other combinatorial optimization problems. Similarly, the al- 
gorithm above can be adjusted to solve highly stable instances of other problems. For example, a similar 
algorithm finds the optimal solution to (weighted) V nA-stable instances of the Multi-way Cut problem, and 
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A-stable instances of the Vertex Cover problem (where again n is the number of vertices in the graph, and 
A the maximal degree). 



3.2 An efficient algorithm for simple graphs of high minimal degree 

A complementary approach is useful when the graph is unweighted, and of high minimal degree. Suppose 
a 7-stable graph, for some big (but bounded) 7 has minimal degree n/2. Then by local stability each side 
in the maximal cut must be of size nearly n/2, and the neighborhoods of any two vertices on the same side 
have most of their vertices in common. Thus we can easily cluster together the vertices into the two sides 
of the maximal cut. Even when the minimal degree is lower, we can use the same scheme to obtain several 
clusters of vertices which are certain to be on the same side, and then use the algorithm from the previous 
subsection to find the maximal cut. 

Theorem 3.2. There is an algorithm that solves correctly and in polynomial time every instance of un- 
weighted Max-Cut that is ^-stable for every 7 > Here an instance is an n-vertex graph G = (y, E) of 
minimal degree 5. Furthermore, if 5 = ^(j^^). then 'j-local stability suffices. 

It clearly suffices to consider 7 = Let N-i C V he the neighbor set of vertex i and di = \Ni\. Define 

i7 to be a graph on V with i, j adjacent if | A^j n Nj\ > Since G is in particular 7-locally stable, 

every vertex i has at most of its neighbors on its own side of the maximal cut. Hence, the vertices of 
each connected component of H must be on the same side of the maximal cut. 

Let c be the number of connected components in H and let [/ C F be a set of c vertices, with exactly one 
vertex from each of these connected components. Let the degrees of the vertices in U be di^ < di^ < ■ ■ ■ < 
di^. For any u,v £ U we have that | A^^ n A^y| < HHSMii^y: \Ye claim that c < 7. If this is not the case, let 
us apply the inclusion-exclusion formula and conclude: 



7 7 i-i , 7 -I i-i J 7 • -I 

1 1 \N,\ > y{d,.-y^) = y ^) > y a - ^) 

T ri7 + i ^ ' 7 + ' 7 + 1 

1 j=l k=l j=l k=l ^ j=l 

since, by assumption di^ < di^ for k < j. Also, d,^ > 6 for all j, and clearly | |JJ A^J < n. Therefore, 
n >llV.I> 5^(1-^) = 5(7-4^) >- 

' Y ^ ^ 7 + 1 2 7 + 1^ - 2 

1 j=i 

a contradiction which implies c < 7. 

Now consider the graph G' obtained from G by contracting all vertices in each Cj into a single vertex, 
keeping multiple edges. By our previous observation, G' has the same max-cut value as G. Consequently, 
as discussed at the beginning of this section, the graph G' is 7-stable. It follows that G' is a weighted graph 
whose stability exceeds its number of vertices. By Theorem 13. 11 the optimal cut in G' (and hence in G) can 
be found in polynomial time, as claimed. 

It is also worth mentioning that if 6 = ^ii^^) then 7 is 0(log n), and we can find the maximal cut in G' 
by going over all cuts. Moreover, in this case it suffices to assume that G is 7-locally stable. | 
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4 A spectral approach 



4.1 Definitions 

Spectral partitioning is a general name for a number of heuristic methods for various graph partitioning 
problems which are popular in several application areas. The common theme is to consider an appropriate 
eigenvector of a possibly weighted adjacency matrix of the graph in question, and partition the vertices 
according to the corresponding entries. Why is it at least conceivable that such an approach should yield a 
good solution for Max-Cut? The Max-Cut problem can can clearly be formulated as: 

(«j)g£; 

The Goemans-Williamson algorithm |7| works by solving an SDP relaxation of the problem. In other words, 
where as above we multiply the matrix by a rank 1 PSD matrix, in the SDP relaxation, we multiply it be 
a PSD matrix of rank (at most) n. Let us consider instead the relaxation of the condition y G {—1, 1}", to 
y e R", I |y| P = n. The resulting problem is well-known: By the variational characterization of eigenvalues, 
this relaxation amounts to finding the eigenvector corresponding to the least eigenvalue of W . Let u be such 
a vector. This suggests a spectral partitioning of W that is the partition of [n] induced by u. 
We also consider what we call extended spectral partitioning: Let D be a diagonal matrix. Think oiW + D 
as the weighted adjacency matrix of a graph, with loops added. Such loops do not change the weight of any 
cut, so that regardless of what D we choose, a cut is maximal in W iff it is maximal mW + D. Furthermore, 
it is not hard to see that W is 7-stable, iffW + D is. Our approach is to first find a "good" D, and then take 
the spectral partitioning of + D as the maximal cut. These observations suggest the following question: Is 
it true that for every 7-stable instance W with 7 large enough there exists a diagonal D for which extended 
spectral partitioning solves Max-Cut? If so, can such a be found efficiently? Below we present certain 
sufficient conditions for these statements. 



4.2 Spectral partitions of stable instances 

The input to the max cut problem is a symmetric nonnegative n x n matrix W with zero trace. The support 
of is a graph G = {V, E) with vertex set V = [n] where [i, j) G E iff Wij > 0. 

Lemma 4.1. Let W be a ^-stable instance of Max-Cut with support G = {V, E). Let D be a diagonal 
matrix, and u an eigenvector corresponding to the least eigenvalue ofW + D. If ^ > "'"^ ^ then 

the spectral partitioning induced byW + D yields the maximal cut. 

Proof: As noted above, for any diagonal matrix D, the problems of finding a maximal cut W + D and in 
W are equivalent. Normalize u so that min(j j)^^; \ui ■ Uj\ = 1. (If u has any coordinates, the statement 
of the lemma is meaningless). Let D' be the diagonal matrix D[ ^ = Di^i ■ uf. Let W' be the matrix 
Wl j = Wij ■ \uiUj\. Observe that W is a 7-perturbation of W, hence the maximal cut in W' (and in 
W + D'), is the same as in W. In other words, mc* is a vector that minimizes the expression: 

mill x{W' + D')x. 

xe{-i,i}" 

Also, the vector u minimizes the expression 
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Think of u as being revealed in two steps. First, the absolute value of each coordinate is revealed, and then, 
in the second step, its sign. Thus, in the second step we are looking for a sign vector x that minimizes the 
expression: 

C^Wij ■ \Ui\Xi ■ \Uj\Xj + ^ A,^'U.^^)/||^i|P• 
^J i 

Clearly, mc* is such a vector. Since the input is stable, the optimal cut is unique, and so mc* and —mc* are 
the only such vectors. Hence, the partition they induce is the same as that induced by u. | 

Note 4.1. A more careful analysis shows a somewhat stronger result. It suffices that 

7 > ■■ • 

4.3 A sufficient condition for extended spectral partitioning 

Lemma 4.2. Let W be a ^-stable instance of Max-Cut, for 7 > 1, and let D be the diagonal matrix 
D'l^i = mc* Wijmc'j. IfW + Dis positive semi-definite, then extended spectral partitioning solves 
Max-Cut for W efficiently. 

Proof: It is easy to see that the vector mc* is in the kernel ofW + D. Since W + D is positive semidefinite, 
is its least eigenvalue, and mc* is an eigenvector of W + D corresponding to the smallest eigenvalue. 
Hence, the assertion of Lemma |4~T] holds . It remains to show that Max-Cut can be found efficiently. 

Observe that trace{D) = w{Ecut) — w{Enotcut) = 2 • w{Ecut) — w{E), where Ecut is the set of edges in 
the maximal cut, and Enotcut is the set of all other edges. Hence, to determine the value of the Max-Cut, it 
suffices to compute m = trace{D). Since mc*{W + D)mc* = 0, it follows that mc* W mc* = —m. 

We claim that m = min trace (^4) over ^4 e A, where A is the set of all positive definite matrices A such 
that Aij = Wij for i ^ j. (As we discuss in subsection 15 . 1 I below. this is the dual problem of the Goemans- 
WiUiamson relaxation (||7l).) 

That the smallest such trace is < m follows since W -\- D ^ A. For the reverse inequality note that every 
A £ A satisfies mc*Amc* = —m + trace{A). But A is positive semidefinite so trace{A) > m as claimed. 

As observed by Delorme and Poljak |4| (and, in fact already in fT\), the theory developed by Grotschel, 
Lovasz and Schrijver [ 8^ ^ around the ellipsoid algorithm makes it possible to efficiently solve the above 
optimization problem. 

Note that the solution to the optimization problem is not necessarily unique, but this can be overcome by 
slightly perturbing W at random. If W is stable, then such a modification leaves mc* unchanged. | 

If is a real symmetric matrix under consideration, we denote its eigenvalues by Ai > • • • > A„. We show 
next that if the last two eigenvalues are sufficiently small in absolute value, then the assertion in Lemma 
14.21 holds. We also recall the notation w{i) = Since w{i) can be viewed as a "weighted vertex 

degree", we denote miiij by 5 = 6{W). 

Lemma 4.3. Let W be a ^-locally stable instance of Max-Cut with spectrum Ai > • • • > A„, support G 
and smallest weighted degree 6. Let D be a diagonal matrix with Di^i = mc* Wi^jmc*. If 

26 ■ + A„ + A„_i > 0, 

7 + 1 
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then W + D is positive semidefinite. Furthermore, ifW is 7 stable for 7 > 1 then Max-Cut can be found 
efficiently. 



Proof: Let x (resp. y) be a unit eigenvectors oiW + D corresponding to the smallest (second smallest) 
eigenvalue ofW + D. We can and will assume that x and y are orthogonal. Since is an eigenvalue of 
W + D (with eigenvector mc*) it follows that x{W + D)x < 0. If we can show that y{W + D)y > 0, then 
the second smallest eigenvalue ofW + Dis positive, and this matrix is positive semidefinite, as claimed. 

By local stability, j > ^^5, so all of D's eigenvalues are at least 

Therefore 

7 - 1 ~ 

xWx < —xDx < 6. 

~ 7 + 1 

By the variational theory of eigenvalues (the Courant-Fischer Theorem), since x and y are two orthogonal 
unit vectors there holds 

An + A„_i < xWx + yWy. 

Also, 

7 - 1 ~ 

— —S < yDy. 
7 + 1 

When we sum the three inequalities it follows that 

2^^5 + A„ + A„_i < yiW + D)y. 
7 + 1 

The Lemma follows. Lemma l4!2l implies that extended spectral partitioning solves Max-Cut for W. | 



4.4 Examples of graph families on which Max- Cut can be found efficiently 



Lemma 143] gives a sufficient conditon under which the extended spectral partitioning solves Max-Cut ef- 
ficiently. In this subsection we identify certain families of graphs for which the assertion in the lemma 
holds. 

Example 4.1. Let G be a 1 + e stable, j-locally stable graph with all w{i) equal. Let Xn-i > A^ be its two 
smallest eigenvalues. Max-Cut can be found efficiently on G if 

An-l 

A„ 7 + 1' 

and e > 0. 
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Proof: By the Perron-Frobenius theorem, Ai = 6, and the all-one vector is the corresponding eigenvector. 
It also implies that (5 = Ai > |A„|. For the condition in lemma [43] to hold, it thus suffices that 

7-1 

~2 • A„ — — + A„ + A„_i > 0, 
7 + 1 

which is exactly the stated condition. | 

Example 4.2. Let G be a 1 + e stable, j-locally stable d-regular simple graph with second eigenvalue A. 
Max-Cut can be found efficiently on G if 

5d + X 

and e > 0. 

Proof: Let A be the adjacency matrix of G, and Ain the adjacency matrix of the graph spanned by the edges 
of the maximal cut. Let A^ut = A — A^n- Since G is 7-locally stable the maximal degree in A^ut^ and hence 
its spectral radius, is at most Therefore, by subtracting Aout from A, eigenvalues are shifted by at most 
this value (this follows, e.g., by Weyl's theorems on matrix spectra). In other words, the second eigenvalue 
of Ain is at most A + Since A^ is bipartite, its spectrum is symmetric, and so |A„„i(Aj„)| < A + 

Now adding Aout to Ain again shifts the spectrum by at most and so |Ari_i(A)| < A + In addition, 
by the Perron-Forbenius theorem, |An(^)| < d and so 

-{Xn{A)+Xn^liA)) <d+X+ 



7 + 1 

For the condition in lemma 1431 to hold, it thus suffices that 

,7-1 , ^ 2d 
2d ■ > d + X + 



7+1 7+1 
as claimed. | 

Example 4.3. Let G = (y, E) be a\ + e stable, d-regular simple graph with Chegger constant h. Max-Cut 
can be found efficiently on G if 

5 + - {h/df 
1 - VI - {h/df ' 

and e > 0. 

Proof: Recall that the Cheeger constant of a graph is defined as 

KG)= mm 

^ ' UCV:\U\<^ \U\ 

and provides on upper bound on G's second eigenvalue (e.g. lfT2l '): 



A2(G) < - h{GY. 
By Example l4.2I Max-Cut can be found efficiently on G. | 

Example 4.4. Let G = (y, E) be a \ + e stable, {k, "f)-distinct d-regular simple graph. Max-Cut can be 
found efficiently on G if 

5 + y/l - {k/dY 

^> 1- Vl-(A:/d)2' 

and e > 0. 
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Proof: Let {S, S) be the largest cut in G. Pick an arbitrary set [/ C V of size < n/2. We wiU derive a lower 
bound on \E{U, U)\ and therefore a lower bound on G's Cheeger constant. 

So let us consider the cut (T, T) obtained from {S, S) by swapping the position of each vertex in U. Since 

\U\ < n/2, 

min{|5Ar|, \SAf\} = min{|C/|, |C7|} = \U\. 

Now A;-distinctness implies that \E{T, f)\ < \E{S, S)\-k\U\. But every edge in ^(5", S)\E{T, f) belongs 
to E{U, U). Consequendy, \E{U, U)\ > k\U\, and since U was arbitrary, h> k. 

By Example |4. 3 I Max-Cut can be found efficiently on G. | 

5 Results derived from previous works 

5.1 Performance of the Goemans-Williamson approximation algorithm 

Let us quickly recall the Goemans and Williamson approximation algorithm for Max-Cut (T\. We first 
rephrase the Max-Cut problem as: 



Equivalently, we seek to minimize J2{ij)(^E ^ij^ij ^^^^ l}-matrices Y that are positive semi- 

definite and of rank 1. In the G-W algorithm the rank constraint is relaxed, yielding a semi-definite pro- 
gramming problem which can be solved efficiently with approximation guai^antee of 0.8786. Moreover, 
they show that when the weight of the maximal cut is sufficiently big, this guarantee can be improved. 
Namely, let (> ^) be the ratio between the weight of the maximal cut and the total weight of the edges. 
Let h{t) = arccos{l — 2t)/iT. Then the approximation ratio is at least h{R)/ R. 

By local stability, the contribution of each v ^ V to the maximal cut is the total weight of the edges 
incident with it. Summing this over all vertices, we get that the maximal cut weighs at least R = of 
the total weight. Thus, the performance guarantee of the G-W algorithm on 7-stable instances is at least 

Note that for this we only required local stability. 

The semi-definite program used in the G-W algorithm can be strengthened when the input is 7-stable, by 
inequalities that express this stability. It is interesting whether these additional constraints can improve the 
approximation ratio further. 

5.2 Spectrally partitioning random graphs 

Consider the following model for random weighted graphs. Let P be some probability measure on [0, 00). 
Generate a matrix W' (a weighted adjacency matrix), by choosing each entry j, i < j, independently 
from P. Set Wlj = Wj ^ for i > j, and VF/j = 0. Let C be the set of edges in the maximal cut of W (for 
"reasonable" P's, this will be unique w.h.p.). Set Wij = 7 • ^ for (i, j) G G. 

It is easy to see that W is indeed 7-stable, yet for certain probability measures the problem becomes trivial. 
For example, if P is a distribution on {0, 1}, the maximal cut in W simply consists of all the edges with 
weight 7. 




over y £ {-1, 1}". 
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An even simpler random model is the following. Take n even. Generate an n x n matrix W' as above. 
Choose S C [n], \S\ = n/2 uniformly at random. Let C be the set of edges in the cut {S,S). Set 
Wij = 7 • Wlj for (z, j) G C. Denote this distribution (G(n, P, 7). For an appropriate 7, w.h.p. {S, S) will 
be the maximal cut in W. This random model is close to what is sometimes known as "the planted partition 
model" (llKIllllIIOllSliKIUlIl). 

Following work by Boppana [1 ] on a similar random model (for unweighted graphs), we can deduce that 
w.h.p. the maximal cut of graphs from this distribution can be found efficiently: 

Theorem 5.1. Let P be a distribution with bounded support, expectation fi and variance cr^. There exists a 
polynomial time algorithm that w.h.p. solves Max-Cut for G £ G(n, P, 7), when 7 = 1 + 0,(^J'^^). 

The theorem follows from Lemma |4!2] and the following one, which is an easy consequence of 1 1|: 

Lemma 5.1. Let P be a distribution with bounded support, expectation /i and variance a^. Let G G 
G(n, P, 7), and S the subset chosen in the generating G. Let mc € {—1, 1}" be the indicator vector of the 

cut (S, S). Let D be the diagonal matrix defined by Di i = mc W mc. 7/7 > 1 + n{^J^^), then w.h.p.: 
L mc is the indicator vector of the maximal cut in G. 
2. W + D is positive semi-definite. 

6 Conclusion and open problems 

In this work we have shown that stability, supplemented by certain properties of the input instance, allows 
for an efficient algorithm for Max-Cut. However, if nothing is assumed about the input, we only know that 
n-stability is sufficient. Can this be improved? Note that 7 > n is very far from what happens in the random 

model, where it is only required that 7 > 1 + Q.[\J'^^). A bold conjecture is that there is some constant, 
7*, s.t. 7* -stable instances can be solved in polynomial time. 

Our motivation in defining stability and distinctness is to identify natural properties of a solution to an NP- 
hard problem, which "make it interesting", and allow finding it in polynomial time. Stability and distinctness 
indeed make Max-Cut amenable, but are in no way the only possible properties, and it would be very 
interesting to suggest others. 
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